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PROTEASES FROM GRAM-POSITIVE ORGANISMS 

FIELD OF THE INVENTION 

The present invention relates to metallo-proteases derived from gram-positive 
microorganisms. The present invention provides nucleic acid and amino acid 
sequences of a metallo-protease identified in Bacillus subtilis. The present invention also 
provides methods for the production of the protease in host cells as well as the production of 
heterologous proteins in a host cell having a mutation or deletion of pan or all of the proteases 
of the present invention. 

BACKGROUND OF THE INVENTION 

Gram-positive microorganisms, such as members of the group Bacillus, have been used 
for large-scale industrial fermentation due, in part, to their ability to secrete their fermentation 
products into the culture media. In gram-positive bacteria, secreted proteins are exported 
across a cell membrane and a cell wall, and then are subsequently released into the external 
media usually maintaining their native conformation. 

Various gram-positive microorganisms are known to secrete extracellular and/or 
intracellular protease at some stage in their life cycles. Many proteases are produced in large 
quantities for industrial purposes. A negative aspect of the presence of proteases in gram- 
positive organisms is their contribution to the overall degradation of secreted heterologous or 
foreign proteins. 

The classification of proteases found in microorganisms is based on their catalytic 
mechanism which results in four groups: the serine proteases; metallo-proteases; cysteine 
proteases; and aspartic proteases. These categories, in general, can be distinguished by their 
sensitivity to various inhibitors. For example, the serine proteases are inhibited by 
phenylmethylsulfonylfluoride (PMSF) and diisopropylfluorophosphate (DIFP); the metallo- 
proteases by chelating agents; the cysteine enzymes by iodoacetamide and heavy metals and the 
aspartic proteases by pepstatin. The serine proteases have alkaline pH optima, the 
metalloproteases are optimally active around neutrality, and the cysteine and aspartic enzymes 
have acidic pH optima ^ Biotechnology Handbooks. Bacillus , vol. 2. edited by Harwood, 1989 

Plenum Press, New York). 

Metallo-proteases form the most diverse of the catalytic types of proteases. Family 
M23 contains bacterial enzymes such as the p-lytic endopeptidases oiLysobacter and 



WO 99/14342 



PCT/US98/18828 



„ 2 ~ 

Achromobacter and the Pseudomonas LasA protein and have specificity for Gly bonds, 
especially in Gly-Gly+Xaa-sequences (Methods in Enzymology, vol. 248, Academic Press, Inc. 
1994). The enzymes of the M23 family contain zinc and a conserved His-Xaa-His motif 

SUMMARY OF THE INVENTION 
The present invention relates to the discovery of a heretofore unknown metallo- 
protease (MP) found in gram positive microorganisms, uses of the MP in industrial 
applications, and advantageous strain improvements based on genetically engineering such 
microorganisms to delete, underexpress or overexpress that MP. Due to the overall 
relatedness of MP with Pseudomonas lasA protein, including the presence of the motif His- 
Xaa-His, MP appears to be a member of the metallo-protease family M23. 

Applicant's discovery, in addition to providing a new and useful protease and methods 
of detecting DNA encoding such proteases in a gram positive microorganism, provides several 
advantages which may facilitate optimization and/or modification of strains of gram positive 
microorganisms, such as Bacillus, for expression of desired, e.g. heterologous, proteins. Such 
optimizations, as described below in detail, allow the construction of strains having decreased 
proteolytic degradation of desired expression products. 

Applicant's invention is further based on the discovery of the presence of MP's in 
Gram-positive microorganisms. The Gram-positive microorganism may be Bacillus and may 
also be selected from the group consisting of Bacillus subtilis. Bacillus stearothermophilus, 
Bacillus licheniformis and Bacillus amyloliquifaciens. The present invention further relies on 
the discovery that naturally occurring MP is encoded by nucleic acid found about 2248 kb from 
the point of origin of Bacillus subtilis 1-168 strain (Bacillus Genetic Stock Center, accession 
number 1 Al, Columbus, Ohio). The present invention relates to the MP encoded thereby, as 
well as the nucleic acid and amino acid molecules having the sequences disclosed in Figures 
lA-lO. 

The present invention thus provides methods for detecting gram positive 
microorganism homologs of B. subtilis MP that comprises hybridizing part or all of the nucleic 
acid encoding B. subtilis MP with nucleic acid derived from gram-positive organisms, either of 
genomic or cDNA origin. Accordingly, the present invention provides a method for detecting a 
gram-positive microorganism MP, comprising the steps of hybridizing gram-positive 
microorganism nucleic acid under low stringency conditions to a probe, wherein the probe 
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comprises part or ail of the nucleic acid sequence shown in Figures 1 A-IO; and isolating gram- 
positive nucleic acid which hybridizes to said probe. 

In a preferred embodiment, the Bacillus is selected from the group consisting of B. 
Hcheniformis, B. lentus, B. brevis, B. stearothermophilus, B. alkalophilus f B. 
amyloliquefaciens, B. coagulans, B. circulans, B. lautus and B. thuringiensis. 

The production of desired heterologous proteins or polypeptides in gram-positive 
microorganisms may be hindered by the presence of one or more proteases, including MP, 
which degrade the produced heterologous protein or polypeptide. One advantage of the 
present invention is that it provides methods and expression systems which can be used to 
prevent that degradation, thereby enhancing yields of the desired heterologous protein or 
polypeptide. Accordingly, the present invention provides a gram-positive microorganism 
having a mutation or deletion of part or all of the gene encoding MP, which results in the 
inactivation of the MP proteolytic activity, either alone or in combination with mutations in 
other proteases, such as apr, npr, epr, mpr, bpf or isp for example, or other proteases known to 
those of skill in the art. In one embodiment of the present invention, the gram-positive 
organism is a member of the genus Bacillus. In another embodiment, the Bacillus is selected 
from the group consisting of B. subtilis, B. licheniformis, B. lentus, B. brevis, B. 
stearothermophilus, B. alkalophilus, B. amyloliquefaciens, B. coagulans, B, circulans, B. 
lautus and Bacillus thuringiensis. In a further preferred embodiment, the Bacillus is Bacillus 
subtilis. 

In another aspect, the gram-positive host having one or more metallo-protease 
deletions or mutations is further genetically engineered to produce a desired protein. In one 
embodiment of the present invention, the desired protein is heterologous to the gram-positive 
host cell. In another embodiment, the desired protein is homologous to the host cell. The 
present invention encompasses a gram-positive host cell having a deletion, mutation or 
interruption of the nucleic acid encoding the naturally occurring homologous protein, such as a 
protease, and having nucleic acid encoding the homologous protein re-introduced in a 
recombinant form. In another embodiment, the host cell produces the homologous protein. 
Accordingly, the present invention also provides methods and expression systems for reducing 
degradation of heterologous proteins produced in gram-positive microorganisms. The gram- 
positive microorganism may be normally sporulating or non-sporulating. In a preferred 
embodiment, the gram positive host cell is a Bacillus. In another preferred embodiment, the 
Bacillus host cell is Bacillus. In another embodiment, the Bacillus is selected from the group 
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consisting of B. subtilis, B. licheniformis, B. lentus, B. brevis, B. siearothermophilus, B. 
alkalophilus, B. amyloliquefaciens, B. coagulans, B. circulans, B. lautus and Bacillus 
thuringiensis. 

Naturally occurring gram positive MP as well as proteolytically active amino acid 
variations or derivatives thereof, have application in the textile industry, in cleaning 
compositions and in animal feed. The metallo-protease MP may be used alone or in 
combination with other enzymes and/or mediators or enhancers. Accordingly, in a further 
aspect of the present invention, gram-positive MP is produced on an industrial fermentation 
scale in a microbial host expression system. The present invention provides a cleaning 
composition comprising a metalloprotease, MP, having the amino acid sequence shown in 
Figures 1 A-IO or the amino acid encoded by the MP nucleic acid found at about 2248 
kilobases from the point of origin of Bacillus si/btilis. Also provided are cleaning 
compositions comprising a metalloprotease having at least 80%, at least 90%, or at least 95% 
homology with the amino acid sequence shown in Figures 1 A-IO or comprising a 
metalloprotease encoded by a gene that hybridizes with the nucleic acid shown in Figures 1 A- 
10 under high stringency conditions. 

Further there is provided an animal feed comprising a metalloprotease, MP, having the 
amino acid sequence shown in Figures 1A-10. Also provided are animal feeds comprising a 
metalloprotease having at least 80%, at least 90%, and at least 95% homology with the amino 
acid sequence shown in Figures 1 A-IO or comprising a metalloprotease encoded by a gene 
that hybridizes with the nucleic acid shown in Figures 1A-10 under high stringency conditions. 

Also provided is a composition for the treatment of a textile comprising a 
metalloprotease, MP, having the amino acid sequence shown in Figures lA-lO. Also provided 
are compositions for the treatment of a textile comprising a metalloprotease having at least 
80%, at least 90%, or at least 95% homology with the amino acid sequence shown in Figures 
1 A-IO or comprising a metalloprotease encoded by a gene that hybridizes with the nucleic acid 
shown in Figures 1 A-IO under high stingency conditions. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1 A-IO shows the DNA and amino acid sequence for Bacillus subtilis MP. 

Figure 2 show an amino acid alignment of Bacillus subtilis MP (designated as YOMI) and 
Pseudomonas LasA. The amino acid motif H-X-H is noted at amino acid 308 -3 10 in LasA. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Definitions 

As used herein, the genus Bacillus includes all members known to those of skill in the 
art, including but not limited to B, subtilis, B. licheniformis, B. lentus, B. brevis, B. 
stearothermophilus, B. alkalophilus, B. amyloliquefaciens, B. coagulans, B. ciculans, B. 
lautus andB, thuringiensis. 

The present invention relates to a newly characterized metallo-protease (MP) from 
gram positive organisms. In a preferred embodiment, the metallo-protease is obtainable from a 
gram-positive organism which is a Bacillus. In another preferred embodiment, the metallo- 
protease is obtainable from a Bacillus which is selected from the group consisting of B. subtilis, 
B. licheniformis, B. lentus, B. brevis, B. stearothermophilus, B. alkalophilus. B. 
amyloliquefaciens, B. coagulans, B. ciculans, B. lautus and B. thuringiensis. 

In another preferred embodiment, the gram-positive organism is Bacillus subtilis and 
MP has the amino acid sequence encoded by the nucleic acid molecule having the sequence 
that occurs around 2248 kilobases from the point of origin of Bacillus subtilis 1-168. 

In another preferred embodiment, Bacillus subtilis has the nucleic acid and amino acid 
sequence as shown in Figures I A-IO. The present invention encompasses the use of amino 
acid variations of the amino acid sequences disclosed in Figures 1 A-IO that have proteolytic 
activity. Such proteolytic amino acid variants can be used in the textile industry, animal feed 
and in cleaning compositions. The present invention also encompasses the use of B. subtilis 
amino acid variations or derivatives that are not proteolytically active. DNA encoding such 
variants can be used in methods designed to delete or mutate the naturally occurring host cell 
MP. 

As used herein, "nucleic acid" refers to a nucleotide or polynucleotide sequence, and 
fragments or portions thereof, and to DNA or RNA of genomic or synthetic origin which may 
be double-stranded or single-stranded, whether representing the sense or antisense strand. As 
used herein "amino acid" refers to peptide or protein sequences or portions thereof A 
"polynucleotide homolog" as used herein refers to a gram-positive microorganism 
polynucleotide that has at least 80%, at least 90% and at least 95% identity to B.subtilis MP, 
or which is capable of hybridizing to B.subtilis MP under conditions of high stringency and 
which encodes an amino acid sequence having metallo-protease activity. 
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The terms "isolated" or "purified" as used herein refer to a nucleic acid or amino acid 
that is removed from at least one component with which it is naturally associated. 

As used herein, the term "heterologous protein" refers to a protein or polypeptide that 
does not naturally occur in a gram-positive host cell. Examples of heterologous proteins 
include enzymes such as hydrolases including proteases, cellulases, amylases, carbohydrases, 
and lipases; isomerases such as racemases, epimerases, tautomerases, or mutases; transferases, 
kinases and phophatases. The heterologous gene may encode therapeutically significant 
proteins or peptides, such as growth factors, cytokines, ligands, receptors and inhibitors, as 
well as vaccines and antibodies. The gene may encode commercially important industrial 
proteins or peptides, such as proteases, carbohydrases such as amylases and glucoamyiases, 
cellulases, oxidases and lipases. The gene of interest may be a naturally occurring gene, a 
mutated gene or a synthetic gene. 

The term "homologous protein" refers to a protein or polypeptide native or naturally 
occurring in a gram-positive host cell. The invention includes host cells producing the 
homologous protein via recombinant DNA technology. The present invention encompasses a 
gram-positive host cell having a deletion or interruption of the nucleic acid encoding the 
naturally occurring homologous protein, such as a protease, and having nucleic acid encoding 
the homologous protein re-introduced in a recombinant form. In another embodiment, the host 
cell produces the homologous protein. 

As used herein, the term "overexpressing" when referring to the production of a 
protein in a host cell means that the protein is produced in greater amounts than its production 
in its naturally occurring environment. 

As used herein, the phrase "proteolytic activity" refers to a protein that is able to 
hydrolyze a peptide bond. Enzymes having proteolytic activity are described in Enzyme 
Nomenclature, 1 992, edited Webb Academic Press, Inc. 

DETATLED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The unexpected discovery of the metallo-protease M23 family member, designated 
herein as MP, found in translated, uncharacterized B.subtilis genomic sequences provides a 
basis for producing host cells, expression methods and systems which can be used to prevent 
the degradation of recombinantly produced heterologous proteins. 

Accordingly, in a preferred embodiment, the host cell is a gram-positive host cell that 
has a deletion or mutation in the naturally occurring nucleic acid encoding MP said mutation 
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resulting in deletion or inactivation of the production by the host cell of the MP proteolytic 
gene product. The host cell may additionally be genetically engineered to produced a desired 
protein or polypeptide. 

It may also be desired to genetically engineer host cells of any type to produce a gram- 
positive MP. Such host cells are used in large scale fermentation to produce large quantities of 
the protease which may be isolated or purified and used in cleaning products, such as 
detergents, in textile treatments and as animal feed additives. 

I MP Sequences 

The nucleic acid sequence and amino acid sequence for Bacillus subtilis MP are shown 
in Figures 1 A-10. As will be understood by the skilled artisan, due to the degeneracy of the 
genetic code, a variety of polynucleotides can encode the Bacillus subtilis MP having the 
amino acid sequence shown in Figures 1A-10. The present invention encompasses all such 
polynucleotides. 

The present invention encompasses the use of MP polynucleotide homologs encoding 
gram-positive microorganism MPs which have at least 80%, or at least 90% or at least 95% 
identity to B.subtilis MP shown in Figures 1 A-IO as long as the homolog encodes a protein 
that has proteolytic activity. 

Gram-positive polynucleotide homologs of B.subtilis MP may be obtained by standard 
procedures known in the art from, for example, cloned DNA (e.g., a DNA "library"), genomic 
DNA libraries, by chemical synthesis once identified, by cDNA cloning, or by the cloning of 
genomic DNA, or fragments thereof, purified from a desired cell. (See, for example, Sambrook et 
al, 1989, Molecular Cloning, A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York; Glover, D.M. (ed.), 1985, DNA Cloning: A Practical 
Approach, MRL Press, Ltd., Oxford, U.K. Vol. I, II.) A preferred source is from genomic DNA. 

As will be understood by those of skill in the art, the polynucleotide sequence disclosed in 
Figures lA-lO may reflect inadvertent errors inherent to nucleic acid sequencing technology. 
Moreover, the sequence of polynucleotides derived from related species, e.g., other Bacillus, will 
contain variations to the sequences specifically disclosed herein. Nonetheless, one of ordinary skill 
in the art is folly capable of determining the correct sequences from the information provided 
herein regarding the invention. For example, as described below, it is possible to identify the MP 
of the invention by virtue of its location in the microorganism's genome. The present invention 
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encompasses the naturally occurring nucleic acid molecule having the nucleic acid sequence 
obtained from the genomic sequence of Bacillus species. 

Nucleic acid encoding Bacillus subtilis MP starts around 2248 kilobases counting from the 
point of origin in the Bacillus subtilis strain 1-168 (Anagnostopala, 1961, J. BacterioL 81:741-746 
or Bacillus Genomic Stock Center, accession 1 Al, Columbus, Ohio). The Bacillus subtilis point 
of origin has been described in Ogasawara, N. (1995, Microbiology 141:Pt.2 257-59). Bacillus 
subtilis MP has a length of 2285 amino acids. Based upon the location of the DNA encoding 
Bacillus subtilis MP, naturally occurring B.subtilis MP can be obtained by methods known to 
those of skill in the art including PCR technology. 

Oligonucleotide sequences or primers of about 10-30 nucleotides in length can be designed 
from the polynucleotide sequence disclosed in Figures 1 A- 10 and used in PCR technology to 
isolate the naturally occurring sequence from B. subtilis genomic sequences. 

Another general strategy for the "cloning" of B. subtilis genomic DNA pieces for sequencing 
uses inverse PCR. A known region is scanned for a set of appropriate restriction enzyme cleavage 
sites and inverse PCR is performed with a set of DNA primers determined from the outermost 
DNA sequence. The DNA fragments from the inverse PCR are directly used as template in the 
sequencing reaction. The newly derived sequences can be used to design new oligonucleotides. 
These new oligonucleotides are used to amplify DNA fragments with genomic DNA as template. 
The sequence determination on both strands of a DNA region is finished by applying a primer 
walking strategy on the genomic PCR fragments. The benefit of multiple starting points in the 
primer walking results from the series of inverse PCR fragments with different sizes of new 
"cloned" DNA pieces. From the most external DNA sequence a new round of inverse PCR is 
started. The whole inverse PCR strategy is based on the sequential use of conventional taq 
polymerase and the use of long range inverse PCR in those cases in which the taq polymerase 
failed to amplify DNA fragments. Nucleic acid sequencing is performed using standard 
technology. One method for nucleic acid sequencing involves the use of a Perkin-Elmer Applied 
Biosystems 373 DNA sequencer (Perkin-Elmer, Foster City, California). 

Nucleic acid sequences derived from genomic DNA may contain regulatory regions in 
addition to coding regions. Whatever the source, the isolated MP gene should be molecularly 
cloned into a suitable vector for propagation of the gene. 

In the molecular cloning of the gene from genomic DNA, DNA fragments are generated, 
some of which will encode the desired gene. The DNA may be cleaved at specific sites using 
various restriction enzymes. Alternatively, one may use DNAse in the presence of manganese to 



WO 99/14342 



PCTYUS98/18828 



„ 9 „ 

fragment the DNA, or the DNA can be physically sheared, as for example, by sonication. The 
linear DNA fragments can then be separated according to size by standard techniques, including 
but not limited to, agarose and polyacrylamide gel electrophoresis and column chromatography. 

Once the DNA fragments are generated, identification of the specific DNA fragment 
containing the MP may be accomplished in a number of ways. For example, a B.subtilis MP 
gene of the present invention or its specific RNA, or a fragment thereof, such as a probe or 
primer, may be isolated and labeled and then used in hybridization assays to detect a gram- 
positive MP gene. (Benton, W. and Davis, R., 1977. Science 196 :180: Grunstein, M. And 
Hogness, D., 1975, Proc, Natl. Acad. Sci USA 72:3961). Those DNA fragments sharing 
substantial sequence similarity to the probe will hybridize under stringent conditions. 

Accordingly, the present invention provides a method for the detection of gram- 
positive MP polynucleotide homologs which comprises hybridizing part or all of a nucleic acid 
sequence of B, subtilis MP with gram-positive microorganism nucleic acid of either genomic or 
cDNA origin. 

Also included within the scope of the present invention is the use of gram-positive 
microorganism polynucleotide sequences that are capable of hybridizing to the nucleotide 
sequence of B.subtiUs MP under conditions of intermediate to maximal stringency. 
Hybridization conditions are based on the melting temperature (Tm) of the nucleic acid binding 
complex, as taught in Berger and Kimmel (1987, Guide to Molecular Cloning Techniques. 
Methods in Enzvmologv. Vol 152. Academic Press, San Diego CA^ incorporated herein by 
reference, and confer a defined "stringency" as explained below. 

"Maximum stringency" typically occurs at about Tm-5°C (5°C below the Tm of the 
probe); "high stringency" at about 5°C to 10°C below Tm; "intermediate stringency" at about 
10°C to 20°C below Tm; and "low stringency" at about 20°C to 25°C below Tm. As will be 
understood by those of skill in the art, a maximum stringency hybridization can be used to 
identify or detect identical polynucleotide sequences while an intermediate or low stringency 
hybridization can be used to identify or detect polynucleotide sequence homologs. 

The term "hybridization" as used herein shall include "the process by which a strand of 
nucleic acid joins with a complementary strand through base pairing" (Coombs J (1994) 
Dictionary of Biotechnology, Stockton Press, New York NY). 

The process of amplification as carried out in polymerase chain reaction (PCR) 
technologies is described in Dieffenbach CW and GS Dveksler (1995, PCR Primer, a 
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Laboratory Manual, Cold Spring Harbor Press, Plainview NY). A nucleic acid sequence of at 
least about 10 nucleotides and as many as about 60 nucleotides from B. subtilis MP preferably 
about 12 to 30 nucleotides, and more preferably about 20-25 nucleotides can be used as a 
probe or PCR primer. 

TheB.subtilis MP amino acid sequences (shown in Figures lA-lO) were identified via 
a BLAST search (Altschul, Stephen, Basic local alignment search tool, J. Mol. Biol. 215:403- 
410) of Bacillus subtilis genomic nucleic acid sequences. B. subtilis MP (YOMI) was 
identified by its overall nucleic acid identity to the metallo-protease, Pseudomonas lasA, 
including the presence of the catalytic domain H-X-H as shown in Figure 2. 

II. Expression Systems 

The present invention provides host cells, expression methods and systems for the 
enhanced production and secretion of desired heterologous or homologous proteins in gram- 
positive microorganisms. In one embodiment, a host cell is genetically engineered to have a 
deletion or mutation in the gene encoding a gram-positive MP such that the respective activity 
is deleted-. In another embodiment of the present invention, a gram-positive microorganism is 
genetically engineered to produce a metallo-protease of the present invention. 

Inactivation of a gram-positive metallo-protease in a host cell 

Producing an expression host cell incapable of producing the naturally occurring 
metallo-protease necessitates the replacement and/or inactivation of the naturally occurring 
gene from the genome of the host cell. In a preferred embodiment, the mutation is a non- 
reverting mutation. 

One method for mutating nucleic acid encoding a gram-positive metallo-protease is to 
clone the nucleic acid or part thereof, modify the nucleic acid by site directed mutagenesis and 
reintroduce the mutated nucleic acid into the cell on a plasmid. By homologous recombination, 
the mutated gene may be introduced into the chromosome. In the parent host cell, the result is 
that the naturally occurring nucleic acid and the mutated nucleic acid are located in tandem on 
the chromosome. After a second recombination, the modified sequence is left in the 
chromosome having thereby effectively introduced the mutation into the chromosomal gene for 
progeny of the parent host cell. 

Another method for inactivating the metallo-protease proteolytic activity is through 
deleting the chromosomal gene copy. In a preferred embodiment, the entire gene is deleted, 
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the deletion occurring in such as way as to make reversion impossible. In another preferred 
embodiment, a partial deletion is produced, provided that the nucleic acid sequence left in the 
chromosome is too short for homologous recombination with a plasmid encoded metallo- 
protease gene. In another preferred embodiment, nucleic acid encoding the catalytic amino 
acid residues are deleted. 

Deletion of the naturally occurring gram-positive microorganism metallo-protease can 
be carried out as follows. A metallo-protease gene including its 5' and 3' regions is isolated 
and inserted into a cloning vector. The coding region of the metallo-protease gene is deleted 
form the vector in vitro, leaving behind a sufficient amount of the 5' and 3' flanking sequences 
to provide for homologous recombination with the naturally occurring gene in the parent host 
cell. The vector is then transformed into the gram-positive host cell. The vector integrates 
into the chromosome via homologous recombination in the flanking regions. This method 
leads to a gram-positive strain in which the protease gene has been deleted. 

The vector used in an integration method is preferably a plasmid. A selectable marker 
may be included to allow for ease of identification of desired recombinant microorgansims. 
Additionally, as will be appreciated by one of skill in the art, the vector is preferably one which 
can be selectively integrated into the chromosome. This can be achieved by introducing an 
inducible origin of replication, for example, a temperature sensitive origin into the plasmid. By 
growing the transformants at a temperature to which the origin of replication is sensitive, the 
replication function of the plasmid is inactivated, thereby providing a means for selection of 
chromosomal integrants. Integrants may be selected for growth at high temperatures in the 
presence of the selectable marker, such as an antibiotic. Integration mechanisms are described 
in WO 88/06623. 

Integration by the Campbell-type mechanism can take place in the 5' flanking region of 
the protease gene, resulting in a protease positive strain carrying the entire plasmid vector in 
the chromosome in the metallo-protease locus. Since illegitimate recombination will give 
different results it will be necessary to determine whether the complete gene has been deleted, 
such as through nucleic acid sequencing or restriction maps. 

Another method of inactivating the naturally occurring metallo-protease gene is to 
mutagenize the chromosomal gene copy by transforming a gram-positive microorganism with 
oligonucleotides which are mutagenic. Alternatively, the chromosomal metallo-protease gene 
can be replaced with a mutant gene by homologous recombination. 
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The present invention encompasses host cells having additional protease deletions or 
mutations, such as deletions or mutations in apr, npr, epr, mpr and others known to those of 
skill in the art. 

One assay for the detection of mutants involves growing the Bacillus host cell on 
medium containing a protease substrate and measuring the appearance or lack thereof, of a 
zone of clearing or halo around the colonies. Host cells which have an inactive protease will 
exhibit little or no halo around the colonies. 

III. Production of Metallo-protease 

For production of metallo-protease in a host cell, an expression vector comprising at 
least one copy of nucleic acid encoding a gram-positive microorganism MP, and preferably 
comprising multiple copies, is transformed into the host cell under conditions suitable for 
expression of the metallo-protease. In accordance with the present invention, polynucleotides 
which encode a gram-positive microorganism MP, or fragments thereof, or fusion proteins or 
polynucleotide homolog sequences that encode amino acid variants of B.subtilis MP, may be 
used to generate recombinant DNA molecules that direct their expression in host cells. In a 
preferred embodiment, the gram-positive host cell belongs to the genus Bacillus. In another 
preferred embodiment, the gram positive host cell is B. subtilis. 

As will be understood by those of skill in the art, it may be advantageous to produce 
polynucleotide sequences possessing non-naturally occurring codons. Codons preferred by a 
particular gram-positive host cell (Murray E et al (1989) Nuc Acids Res 17:477-508) can be 
selected, for example, to increase the rate of expression or to produce recombinant RNA 
transcripts having desirable properties, such as a longer half-life, than transcripts produced 
from naturally occurring sequence. 

Altered MP polynucleotide sequences which may be used in accordance with the 
invention include deletions, insertions or substitutions of different nucleotide residues resulting 
in a polynucleotide that encodes the same or a functionally equivalent MP homolog, 
respectively. As used herein a "deletion" is defined as a change in either nucleotide or amino 
acid sequence in which one or more nucleotides or amino acid residues, respectively, are 
absent. 

As used herein an "insertion" or "addition" is that change in a nucleotide or amino acid 
sequence which has resulted in the addition of one or more nucleotides or amino acid residues, 
respectively, as compared to the naturally occurring MP. 
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As used herein "substitution" results from the replacement of one or more nucleotides 
or amino acids by different nucleotides or amino acids, respectively. 

The encoded protein may also show deletions, insertions or substitutions of amino acid 
residues which produce a silent change and result in a functionally equivalent MP variant. 
Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, 
solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues as long 
as the variant retains the ability to modulate secretion. For example, negatively charged amino 
acids include aspartic acid and glutamic acid; positively charged amino acids include lysine and 
arginine; and amino acids with uncharged polar head groups having similar hydrophilicity 
values include leucine, isoleucine, valine; glycine, alanine; asparagine, glutamine; serine, 
threonine, phenylalanine, and tyrosine. 

The MP polynucleotides of the present invention may be engineered in order to modify 
the cloning, processing and/or expression of the gene product. For example, mutations may be 
introduced using techniques which are well known in the art, eg, site-directed mutagenesis to 
insert new restriction sites, to alter glycosylation patterns or to change codon preference, for 
example. 

In one embodiment of the present invention, a gram-positive microorganism MP 
polynucleotide may be ligated to a heterologous sequence to encode a fusion protein. A fusion 
protein may also be engineered to contain a cleavage site located between the metallo-protease 
nucleotide sequence and the heterologous protein sequence, so that the metallo-protease may 
be cleaved and purified away from the heterologous moiety. 

IV. Vector Sequences 

Expression vectors used in expressing the metallo-proteases of the present invention in 
gram-positive microorganisms comprise at least one promoter associated with a metallo- 
protease selected from the group consisting of MP, which promoter is functional in the host 
cell. In one embodiment of the present invention, the promoter is the wild-type promoter for 
the selected metallo-protease and in another embodiment of the present invention, the 
promoter is heterologous to the metallo-protease, but still functional in the host cell. In one 
preferred embodiment of the present invention, nucleic acid encoding the metallo-protease is 
stably integrated into the microorganism genome. 

In a preferred embodiment, the expression vector contains a multiple cloning site 
cassette which preferably comprises at least one restriction endonuclease site unique to the 
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vector, to facilitate ease of nucleic acid manipulation. In a preferred embodiment, the vector 
also comprises one or more selectable markers. As used herein, the term selectable marker 
refers to a gene capable of expression in the gram-positive host which allows for ease of 
selection of those hosts containing the vector. Examples of such selectable markers include but 
are not limited to antibiotics, such as, erythromycin, actinomycin, chloramphenicol and 
tetracycline. 

V. Transformation 

A variety of host cells can be used for the production Bacillus subtilis MP or MP 
homologs including bacterial, fungal, mammalian and insects cells. General transformation 
procedures are taught in Current Protocols In Molecular Biology (vol. 1, edited by Ausubel et 
al., John Wiley & Sons, Inc. 1987, Chapter 9) and include calcium phosphate methods, 
transformation using DEAE-Dextran and electroporation. Plant transformation methods are 
taught in Rodriquez (WO 95/14099, published 26 May 1995). 

In a preferred embodiment, the host cell is a gram-positive microorganism and in 
another preferred embodiment, the host cell is Bacillus. In one embodiment of the present 
invention, nucleic acid encoding one or more MP(s) of the present invention is introduced into 
a host cell via an expression vector capable of replicating within the Bacillus host cell. Suitable 
replicating piasmids for Bacillus are described in Molecular Biological Methods for Bacillus, 
Ed. Harwood and Cutting, John Wiley & Sons, 1990, hereby expressly incorporated by 
reference; see chapter 3 on piasmids. Suitable replicating piasmids for B. subtilis are listed on 
page 92. 

In another embodiment, where it is desired to produce the MP for use in cleaning 
compositions, nucleic acid encoding MP is stably integrated into the microorganism genome. 
Preferred host cells are gram-positive host cells. Another preferred host is Bacillus. Another 
preferred host is Bacillus subtilis. Several strategies have been described in the literature for 
the direct cloning of DNA in Bacillus. Plasmid marker rescue transformation involves the 
uptake of a donor plasmid by competent cells carrying a partially homologous resident plasmid 
(Contente et al., Plasmid 2:555-571 (1979); Haima et al, Mol. Gen. Genet. 223:185-191 
(1990); Weinrauch et al., J. Bacteriol. 154(3):1077-1087 (1983); and Weinrauch et al., J. 
Bacterid. 169(3):1205-121 1 (1987)). The incoming donor plasmid recombines with the 
homologous region of the resident "helper" plasmid in a process that mimics chromosomal 
transformation. 
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Protoplast transformation is described for B. subtilis\n Chang and Cohen, (1979) Mol. 
Gen. Genet 168: 1 1 1-1 15; for B.megaterium in Vorobjeva et al, (1980) FEMS Microbiol. 
Letters 7:261-263; for BMmyloliquefaciens in Smith et al., (1986) Appl. and Env. Microbiol. 
51:634; for B.thuringiensis in Fisher et al., (1981) Arch. Microbiol. 139:213-217; for 
B.sphaericus in McDonald (1984) J. Gen. Microbiol. 130:203; and B.larvae in Bakhiet et al., 
(1985, Appl. Environ. Microbiol. 49:577). Mann et al., (1986, Current Microbiol. 13:131- 
135) report on transformation of Bacillus protoplasts and Holubova, (1985) Folia Microbiol 
30:97) disclose methods for introducing DNA into protoplasts using DNA containing 
liposomes. 

VI. Identification of Transformants 

Whether a host cell has been transformed with a mutated or a naturally occurring gene 
encoding a gram-positive MP, detection of the presence/absence of marker gene expression 
can suggest whether the gene of interest is present However, its expression should be 
confirmed. For example, if the nucleic acid encoding an MP of the present invention is inserted 
within a marker gene sequence, recombinant cells containing the insert can be identified by the 
absence of marker gene function. Alternatively, a marker gene can be placed in tandem with 
nucleic acid encoding the MP under the control of a single promoter. Expression of the marker 
gene in response to induction or selection usually indicates expression of the MP as well. 

Alternatively, host cells which contain the coding sequence for a metallo-protease and 
express the protein may be identified by a variety of procedures known to those of skill in the 
art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridization 
and protein bioassay or immunoassay techniques which include membrane-based, solution- 
based, or chip-based technologies for the detection and/or quantification of the nucleic acid or 
protein. 

The presence of the metallo-protease polynucleotide sequence can be detected by 
DNA-DNA or DNA-RNA hybridization or amplification using probes, portions or fragments 
of B.subtilis MP. 

VII Assay of Protease Activity 

There are various assays known to those of skill in the art for detecting and measuring 
protease activity. There are assays based upon the release of acid-soluble peptides from casein 
or hemoglobin measured as absorbance at 280 nm or colorimetrically using the Folin method 
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(Bergmeyer, et al., 1984, Methods of Enzymatic Analysis vol. 5, Peptidases, Proteinases and 
their Inhibitors, Verlag Chemie, Weinheim). Other assays involve the solubilization of 
chromogenic substrates (Ward, 1983, Proteinases, in Microbial Enzymes and Biotechnology 
(W.M. Fogarty, ed.), Applied Science, London, pp. 251-317). 

VIII Secretion of Recombinant Proteins 

Means for determining the levels of secretion of a heterologous or homologous protein 
in a gram-positive host cell and detecting secreted proteins include, using either polyclonal or 
monoclonal antibodies specific for the protein. Examples include enzyme-linked 
immunosorbent assay (ELISA), radioimmunoassay (RIA) and fluorescent activated cell sorting 
(FACS). These and other assays are described, among other places, in Hampton R et al (1990, 
Serological Methods, a Laboratory Manual, APS Press, St Paul MN) and Maddox DE et al 
(1983, J Exp Med 158:1211). 

A wide variety of labels and conjugation techniques are known by those skilled in the 
an and can be used in various nucleic and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting specific polynucleotide sequences include 
oligolabeling, nick translation, end-labeling or PCR amplification using a labeled nucleotide. 
Alternatively, the nucleotide sequence, or any portion of it, may be cloned into a vector for the 
production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA 
polymerase such as T7, T3 or SP6 and labeled nucleotides. 

A number of companies such as Pharmacia Biotech (Piscataway NJ), Promega 
(Madison WI), and US Biochemical Corp (Cleveland OH) supply commercial kits and 
protocols for these procedures. Suitable reporter molecules or labels include those 
radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents as well as 
substrates, cofactors, inhibitors, magnetic particles and the like. Patents teaching the use of 
such labels include US Patents 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 
4,275, 149 and 4,366,241 . Also, recombinant immunoglobulins may be produced as shown in 
US Patent No. 4,816,567 and incorporated herein by reference. 

IX Purification of Proteins 

Gram positive host cells transformed with polynucleotide sequences encoding 
heterologous or homologous protein may be cultured under conditions suitable for the 
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expression and recovery of the encoded protein from cell culture. The protein produced by a 
recombinant gram-positive host cell comprising a mutation or deletion of the metallo-protease 
activity will be secreted into the culture media. Other recombinant constructions may join the 
heterologous or homologous polynucleotide sequences to nucleotide sequence encoding a 
5 polypeptide domain which will facilitate purification of soluble proteins (Kroll DJ et al (1993) 
DNA Cell Biol 12:441-53). 

Such purification facilitating domains include, but are not limited to, metal chelating 
peptides such as histidine-tryptophan modules that allow purification on immobilized metals 
(Porath J (1992) Protein Expr Purif 3:263-281), protein A domains that allow purification on 
io immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 

purification system (Immunex Corp, Seattle WA). The inclusion of a cleavable linker sequence 
O such as Factor XA or enterokinase (Invitrogen, San Diego C A) between the purification 
y3 domain and the heterologous protein can be used to facilitate purification. 

f^is X Uses of The Present Invention 

| ; y MP and Genetically Engineered Host Cells 

^ The present invention provides genetically engineered host cells comprising mutations, 

08 preferably non-revertable mutations, or deletions in the naturally occurring gene encoding MP 
^| such that the proteolytic activity is diminished or deleted altogether. The host cell may contain 
additional protease deletions, such as deletions of the mature subtilisn protease and/or mature 
neutral protease disclosed in United States Patent No. 5,264,366. 

In a preferred embodiment, the host cell is further genetically engineered to produce a 
desired protein or polypeptide. In a preferred embodiment the host cell is a Bacillus. In 
another preferred embodiment, the host cell is a Bacillus subtilis. 
25 In an alternative embodiment, a host cell is genetically engineered to produce a gram- 

positive MP. In a preferred embodiment, the host cell is grown under large scale fermentation 
conditions. In another preferred embodiment, the MP is isolated and/or purified and used in 
the textile industry, the feed industry and in cleaning compositions such as detergents. 

As noted, MP can be useful in formulating various cleaning compositions. A number of 
so known compounds are suitable surfactants useful in compositions comprising the MP of the 
invention. These include nonionic, anionic, cationic, anionic or zwitterionic detergents, as 
disclosed in US 4,404, 128 and US 4,261,868. A suitable detergent formulation is that 
described in Example 7 of US Patent 5,204,015. The an is familiar with the different 
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formulations which can be used as cleaning compositions. In addition, MP can be used, for 
example, in bar or liquid soap applications, dishcare formulations, contact lens cleaning 
solutions or products, peptide hydrolysis, waste treatment, textile applications, as fiision- 
cleavage enzymes in protein production, etc. MP may comprise enhanced performance in a 
detergent composition (as compared to another detergent protease). As used herein, enhanced 
performance in a detergent is defined as increasing cleaning of certain enzyme sensitive stains 
such as grass or blood, as determined by usual evaluation after a standard wash cycle. 

MP can be formulated into known powdered and liquid detergents having pH between 
6.5 and 12.0 at levels of about .01 to about 5% (preferably .1% to .5%) by weight. These 
detergent cleaning compositions can also include other enzymes such as known proteases, 
amylases, cellulases, lipases or endoglycosidases, as well as builders and stabilizers. 

The addition of MP to conventional cleaning compositions does not create any special 
use limitation. In other words, any temperature and pH suitable for the detergent is also 
suitable for the present compositions as long as the pH is within the above range, and the 
temperature is below the described MP's denaturing temperature. In addition, MP can be used 
in a cleaning composition without detergents, again either alone or in combination with 
builders and stabilizers. 

Proteases can be included in animal feed such as part of animal feed additives as 
described in, for example, US 5,612,055; US 5,314,692; and US 5,147,642. 

One aspect of the invention is a composition for the treatment of a textile that includes 
MP. The composition can be used to treat for example silk or wool as described in 
publications such as RD 216,034; EP 134,267; US 4,533,359; and EP 344,259. 

MP Polynucleotides 

A B.subtlis MP polynucleotide, or any part thereof, provides the basis for detecting the 
presence of gram-positive microorganism MP polynucleotide homologs through hybridization 
techniques and PCR technology. 

Accordingly, one aspect of the present invention is to provide for nucleic acid 
hybridization and PCR probes which can be used to detect polynucleotide sequences, including 
genomic and cDNA sequences, encoding gram-positive MP or portions thereof. In another 
aspect of the present invention, an MP polynucleotide can be used in hybridization technology 
to detect the major protease of a gram-positive microorganism due to the proximity of the MP 
with the major protease. 
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The manner and method of carrying out the present invention may be more fully 
understood by those of skill in the art by reference to the following examples, which examples 
are not intended in any manner to limit the scope of the present invention or of the claims 
directed thereto 

Example I 

Preparation of a Genomic library 

The following example illustrates the preparation of a Bacillus genomic library. 

Genomic DNA from Bacillus cells is prepared as taught in Current Protocols In 
Molecular Biology vol. 1, edited by Ausubel et al., John Wiley & Sons, Inc. 1987, chapter 2. 
4. 1 . Generally, Bacillus ceils from a saturated liquid culture are lysed and the proteins 
removed by digestion with proteinase K. Cell wall debris, polysaccharides, and remaining 
proteins are removed by selective precipitation with CTAB, and high molecular weight 
genomic DNA is recovered from the resulting supernatant by isopropanol precipitation. If 
exceptionally clean genomic DNA is desired, an additional step of purifying the Bacillus 
genomic DNA on a cesium chloride gradient is added. 

After obtaining purified genomic DNA, the DNA is subjected to Sau3 A digestion. 
Sau3 A recognizes the 4 base pair site GATC and generates fragments compatible with several 
convenient phage lambda and cosmid vectors. The DNA is subjected to partial digestion to 
increase the chance of obtaining random fragments. 

The partially digested Bacillus genomic DNA is subjected to size fractionation on a 1% 
agarose gel prior to cloning into a vector. Alternatively, size fractionation on a sucrose 
gradient can be used. The genomic DNA obtained from the size fractionation step is purified 
away from the agarose and ligated into a cloning vector appropriate for use in a host cell and 
transformed into the host cell. 

Example II 

Detection of gram-positive microorganisms 
The following example describes the detection of gram-positive microorganism MP. 
DNA derived from a gram-positive microorganism is prepared according to the 
methods disclosed in Current Protocols in Molecular Biology, Chap. 2 or 3. The nucleic acid 
is subjected to hybridization and/or PCR amplification with a probe or primer derived from 
MP. 
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The nucleic acid probe is labeled by combining 50 pmol of the nucleic acid and 250 mCi 
of [gamma P] adenosine triphosphate (Amersham, Chicago IL) and T4 polynucleotide 
kinase (DuPont NEN®, Boston MA). The labeled probe is purified with Sephadex G-25 super 
fine resin column (Pharmacia). A portion containing 10 7 counts per minute of each is used in a 
typical membrane based hybridization analysis of nucleic acid sample of either genomic or 
cDNA origin. 

The DNA sample which has been subjected to restriction endonuclease digestion is 
fractionated on a 0.7 percent agarose gel and transferred to nylon membranes (Nytran Plus, 
Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 hours at 40 degrees C. 
To remove nonspecific signals, blots are sequentially washed at room temperature under 
increasingly stringent conditions up to 0. 1 x saline sodium citrate and 0.5% sodium dodecyl 
sulfate. The blots are exposed to film for several hours, the film developed and hybridization 
patterns are compared visually to detect polynucleotide homologs of B.subtilis MP. The 
homologs are subjected to confirmatory nucleic acid sequencing. Methods for nucleic acid 
sequencing are well known in the art. Conventional enzymatic methods employ DNA 
polymerase Klenow fragment, SEQUENASE® (US Biochemical Corp, Cleveland, OH) or Taq 
polymerase to extend DNA chains from an oligonucleotide primer annealed to the DNA 
template of interest. 

Various other examples and modifications of the foregoing description and examples 
will be apparent to a person skilled in the art after reading the disclosure without departing 
from the spirit and scope of the invention, and it is intended that all such examples or 
modifications be included within the scope of the appended claims. All publications and 
patents referenced herein are hereby incorporated by reference in their entirety. 



